Dataset statistics
| Number of variables | 23 |
|---|---|
| Number of observations | 6566 |
| Missing cells | 33976 |
| Missing cells (%) | 22.5% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 1.2 MiB |
| Average record size in memory | 184.0 B |
Variable types
| Numeric | 1 |
|---|---|
| Text | 8 |
| Categorical | 10 |
| Unsupported | 4 |
NIVEL has constant value "" | Constant |
Unnamed: 0 is highly overall correlated with DEPARTAMENTO and 2 other fields | High correlation |
DEPARTAMENTO is highly overall correlated with Unnamed: 0 and 2 other fields | High correlation |
JORNADA is highly overall correlated with PLAN | High correlation |
PLAN is highly overall correlated with JORNADA | High correlation |
DEPARTAMENTAL is highly overall correlated with Unnamed: 0 and 2 other fields | High correlation |
ZONA is highly overall correlated with Unnamed: 0 and 2 other fields | High correlation |
SECTOR is highly imbalanced (65.6%) | Imbalance |
AREA is highly imbalanced (59.2%) | Imbalance |
STATUS is highly imbalanced (51.6%) | Imbalance |
MODALIDAD is highly imbalanced (83.4%) | Imbalance |
PLAN is highly imbalanced (55.1%) | Imbalance |
DISTRITO has 163 (2.5%) missing values | Missing |
TELEFONO has 901 (13.7%) missing values | Missing |
SUPERVISOR has 163 (2.5%) missing values | Missing |
DIRECTOR has 1413 (21.5%) missing values | Missing |
CODIGO DISTRITO DEPARTAMENTO MUNICIPIO ESTABLECIMIENTO DIRECCION TELEFONO SUPERVISOR DIRECTOR NIVEL SECTOR AREA STATUS MODALIDAD JORNADA PLAN DEPARTA has 6566 (100.0%) missing values | Missing |
CODIGO DISTRITO DEPARTAMENTO MUNICIPIO ESTABLECIMIENTO DIRECCION TELEFONO SUPERVISOR DIRECTOR NIVEL SECTOR AREA STATUS MODALIDAD JORNADA PLAN has 6566 (100.0%) missing values | Missing |
CODIGO DISTRITO DEPARTAMENTO MUNICIPIO ESTABLECIMIENTO DIRECCION TELEFONO SUPERVISOR DIRECTOR NIVEL SECTOR AREA STATUS MODALIDAD JORNADA PLAN has 6566 (100.0%) missing values | Missing |
CODIGO DISTRITO DEPARTAMENTO MUNICIPIO ESTABLECIMIENTO DIRECCION TELEFONO SUPERVISOR DIRECTOR NIVEL SECTOR AREA STATUS MODALIDAD JORNADA PLAN DE has 6566 (100.0%) missing values | Missing |
ZONA has 5030 (76.6%) missing values | Missing |
Unnamed: 0 has unique values | Unique |
CODIGO has unique values | Unique |
CODIGO DISTRITO DEPARTAMENTO MUNICIPIO ESTABLECIMIENTO DIRECCION TELEFONO SUPERVISOR DIRECTOR NIVEL SECTOR AREA STATUS MODALIDAD JORNADA PLAN DEPARTA is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
CODIGO DISTRITO DEPARTAMENTO MUNICIPIO ESTABLECIMIENTO DIRECCION TELEFONO SUPERVISOR DIRECTOR NIVEL SECTOR AREA STATUS MODALIDAD JORNADA PLAN is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
CODIGO DISTRITO DEPARTAMENTO MUNICIPIO ESTABLECIMIENTO DIRECCION TELEFONO SUPERVISOR DIRECTOR NIVEL SECTOR AREA STATUS MODALIDAD JORNADA PLAN is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
CODIGO DISTRITO DEPARTAMENTO MUNICIPIO ESTABLECIMIENTO DIRECCION TELEFONO SUPERVISOR DIRECTOR NIVEL SECTOR AREA STATUS MODALIDAD JORNADA PLAN DE is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
Reproduction
| Analysis started | 2023-07-31 22:27:47.865968 |
|---|---|
| Analysis finished | 2023-07-31 22:27:53.201520 |
| Duration | 5.34 seconds |
| Software version | ydata-profiling vv4.3.2 |
| Download configuration | config.json |
Unnamed: 0
Real number (ℝ)
HIGH CORRELATION  UNIQUE 
| Distinct | 6566 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 4095.4781 |
| Minimum | 0 |
|---|---|
| Maximum | 15983 |
| Zeros | 1 |
| Zeros (%) | < 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 51.4 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 328.25 |
| Q1 | 1643.25 |
| median | 3286.5 |
| Q3 | 4939.75 |
| 95-th percentile | 15650.75 |
| Maximum | 15983 |
| Range | 15983 |
| Interquartile range (IQR) | 3296.5 |
Descriptive statistics
| Standard deviation | 3921.1605 |
|---|---|
| Coefficient of variation (CV) | 0.95743657 |
| Kurtosis | 3.7299514 |
| Mean | 4095.4781 |
| Median Absolute Deviation (MAD) | 1648.5 |
| Skewness | 2.051825 |
| Sum | 26890909 |
| Variance | 15375499 |
| Monotonicity | Strictly increasing |
| Value | Count | Frequency (%) |
| 0 | 1 | < 0.1% |
| 4385 | 1 | < 0.1% |
| 4395 | 1 | < 0.1% |
| 4394 | 1 | < 0.1% |
| 4393 | 1 | < 0.1% |
| 4392 | 1 | < 0.1% |
| 4391 | 1 | < 0.1% |
| 4390 | 1 | < 0.1% |
| 4389 | 1 | < 0.1% |
| 4388 | 1 | < 0.1% |
| Other values (6556) | 6556 |
| Value | Count | Frequency (%) |
| 0 | 1 | |
| 1 | 1 | |
| 2 | 1 | |
| 3 | 1 | |
| 4 | 1 | |
| 5 | 1 | |
| 6 | 1 | |
| 7 | 1 | |
| 8 | 1 | |
| 9 | 1 |
| Value | Count | Frequency (%) |
| 15983 | 1 | |
| 15982 | 1 | |
| 15981 | 1 | |
| 15980 | 1 | |
| 15979 | 1 | |
| 15978 | 1 | |
| 15977 | 1 | |
| 15976 | 1 | |
| 15975 | 1 | |
| 15974 | 1 |
CODIGO
Text
UNIQUE 
| Distinct | 6566 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 51.4 KiB |
Length
| Max length | 13 |
|---|---|
| Median length | 13 |
| Mean length | 13 |
| Min length | 13 |
Characters and Unicode
| Total characters | 85358 |
|---|---|
| Distinct characters | 11 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 6566 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | 16-01-0138-46 |
|---|---|
| 2nd row | 16-01-0139-46 |
| 3rd row | 16-01-0140-46 |
| 4th row | 16-01-0141-46 |
| 5th row | 16-01-0142-46 |
| Value | Count | Frequency (%) |
| 16-01-0138-46 | 1 | < 0.1% |
| 16-01-0557-46 | 1 | < 0.1% |
| 16-01-0143-46 | 1 | < 0.1% |
| 16-01-0145-46 | 1 | < 0.1% |
| 16-01-0147-46 | 1 | < 0.1% |
| 16-01-0150-46 | 1 | < 0.1% |
| 16-01-0155-46 | 1 | < 0.1% |
| 16-01-0428-46 | 1 | < 0.1% |
| 16-01-0471-46 | 1 | < 0.1% |
| 16-01-0710-46 | 1 | < 0.1% |
| Other values (6556) | 6556 |
Most occurring characters
| Value | Count | Frequency (%) |
| - | 19698 | |
| 0 | 18982 | |
| 1 | 10032 | |
| 4 | 9228 | |
| 6 | 9086 | |
| 2 | 4507 | 5.3% |
| 3 | 3152 | 3.7% |
| 5 | 3080 | 3.6% |
| 8 | 2944 | 3.4% |
| 7 | 2486 | 2.9% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 65660 | |
| Dash Punctuation | 19698 | 23.1% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 18982 | |
| 1 | 10032 | |
| 4 | 9228 | |
| 6 | 9086 | |
| 2 | 4507 | 6.9% |
| 3 | 3152 | 4.8% |
| 5 | 3080 | 4.7% |
| 8 | 2944 | 4.5% |
| 7 | 2486 | 3.8% |
| 9 | 2163 | 3.3% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 19698 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 85358 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| - | 19698 | |
| 0 | 18982 | |
| 1 | 10032 | |
| 4 | 9228 | |
| 6 | 9086 | |
| 2 | 4507 | 5.3% |
| 3 | 3152 | 3.7% |
| 5 | 3080 | 3.6% |
| 8 | 2944 | 3.4% |
| 7 | 2486 | 2.9% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 85358 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| - | 19698 | |
| 0 | 18982 | |
| 1 | 10032 | |
| 4 | 9228 | |
| 6 | 9086 | |
| 2 | 4507 | 5.3% |
| 3 | 3152 | 3.7% |
| 5 | 3080 | 3.6% |
| 8 | 2944 | 3.4% |
| 7 | 2486 | 2.9% |
DISTRITO
Text
MISSING 
| Distinct | 443 |
|---|---|
| Distinct (%) | 6.9% |
| Missing | 163 |
| Missing (%) | 2.5% |
| Memory size | 51.4 KiB |
Length
| Max length | 6 |
|---|---|
| Median length | 6 |
| Mean length | 5.9854756 |
| Min length | 3 |
Characters and Unicode
| Total characters | 38325 |
|---|---|
| Distinct characters | 11 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 51 ? |
|---|---|
| Unique (%) | 0.8% |
Sample
| 1st row | 16-031 |
|---|---|
| 2nd row | 16-031 |
| 3rd row | 16-031 |
| 4th row | 16-005 |
| 5th row | 16-005 |
| Value | Count | Frequency (%) |
| 01-403 | 242 | 3.8% |
| 05-033 | 159 | 2.5% |
| 01-411 | 150 | 2.3% |
| 18-008 | 128 | 2.0% |
| 01-409 | 102 | 1.6% |
| 05-007 | 100 | 1.6% |
| 18-039 | 98 | 1.5% |
| 13-004 | 92 | 1.4% |
| 10-019 | 91 | 1.4% |
| 01-641 | 87 | 1.4% |
| Other values (433) | 5154 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 11029 | |
| 1 | 7785 | |
| - | 6403 | |
| 2 | 2934 | 7.7% |
| 3 | 2565 | 6.7% |
| 4 | 1910 | 5.0% |
| 6 | 1553 | 4.1% |
| 5 | 1409 | 3.7% |
| 8 | 1020 | 2.7% |
| 9 | 945 | 2.5% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 31922 | |
| Dash Punctuation | 6403 | 16.7% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 11029 | |
| 1 | 7785 | |
| 2 | 2934 | 9.2% |
| 3 | 2565 | 8.0% |
| 4 | 1910 | 6.0% |
| 6 | 1553 | 4.9% |
| 5 | 1409 | 4.4% |
| 8 | 1020 | 3.2% |
| 9 | 945 | 3.0% |
| 7 | 772 | 2.4% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 6403 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 38325 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 11029 | |
| 1 | 7785 | |
| - | 6403 | |
| 2 | 2934 | 7.7% |
| 3 | 2565 | 6.7% |
| 4 | 1910 | 5.0% |
| 6 | 1553 | 4.1% |
| 5 | 1409 | 3.7% |
| 8 | 1020 | 2.7% |
| 9 | 945 | 2.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 38325 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 11029 | |
| 1 | 7785 | |
| - | 6403 | |
| 2 | 2934 | 7.7% |
| 3 | 2565 | 6.7% |
| 4 | 1910 | 5.0% |
| 6 | 1553 | 4.1% |
| 5 | 1409 | 3.7% |
| 8 | 1020 | 2.7% |
| 9 | 945 | 2.5% |
DEPARTAMENTO
Categorical
HIGH CORRELATION 
| Distinct | 14 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 51.4 KiB |
| GUATEMALA | |
|---|---|
| ESCUINTLA | |
| HUEHUETENANGO | |
| SUCHITEPEQUEZ | |
| IZABAL | |
| Other values (9) |
Length
| Max length | 13 |
|---|---|
| Median length | 9 |
| Mean length | 9.6749924 |
| Min length | 6 |
Characters and Unicode
| Total characters | 63526 |
|---|---|
| Distinct characters | 21 |
| Distinct categories | 2 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | ALTA VERAPAZ |
|---|---|
| 2nd row | ALTA VERAPAZ |
| 3rd row | ALTA VERAPAZ |
| 4th row | ALTA VERAPAZ |
| 5th row | ALTA VERAPAZ |
Common Values
| Value | Count | Frequency (%) |
| GUATEMALA | 2970 | |
| ESCUINTLA | 599 | 9.1% |
| HUEHUETENANGO | 495 | 7.5% |
| SUCHITEPEQUEZ | 377 | 5.7% |
| IZABAL | 360 | 5.5% |
| CHIMALTENANGO | 349 | 5.3% |
| ALTA VERAPAZ | 348 | 5.3% |
| JUTIAPA | 320 | 4.9% |
| CHIQUIMULA | 172 | 2.6% |
| JALAPA | 149 | 2.3% |
| Other values (4) | 427 | 6.5% |
Length
| Value | Count | Frequency (%) |
| guatemala | 2970 | |
| escuintla | 599 | 8.4% |
| huehuetenango | 495 | 6.9% |
| verapaz | 468 | 6.5% |
| suchitepequez | 377 | 5.3% |
| izabal | 360 | 5.0% |
| chimaltenango | 349 | 4.9% |
| alta | 348 | 4.9% |
| jutiapa | 320 | 4.5% |
| chiquimula | 172 | 2.4% |
| Other values (6) | 698 | 9.8% |
Most occurring characters
| Value | Count | Frequency (%) |
| A | 15018 | |
| E | 7246 | |
| U | 5977 | 9.4% |
| T | 5638 | 8.9% |
| L | 5069 | 8.0% |
| G | 3936 | 6.2% |
| M | 3491 | 5.5% |
| N | 2467 | 3.9% |
| I | 2439 | 3.8% |
| H | 1888 | 3.0% |
| Other values (11) | 10357 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 62936 | |
| Space Separator | 590 | 0.9% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 15018 | |
| E | 7246 | |
| U | 5977 | 9.5% |
| T | 5638 | 9.0% |
| L | 5069 | 8.1% |
| G | 3936 | 6.3% |
| M | 3491 | 5.5% |
| N | 2467 | 3.9% |
| I | 2439 | 3.9% |
| H | 1888 | 3.0% |
| Other values (10) | 9767 |
Space Separator
| Value | Count | Frequency (%) |
| 590 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 62936 | |
| Common | 590 | 0.9% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| A | 15018 | |
| E | 7246 | |
| U | 5977 | 9.5% |
| T | 5638 | 9.0% |
| L | 5069 | 8.1% |
| G | 3936 | 6.3% |
| M | 3491 | 5.5% |
| N | 2467 | 3.9% |
| I | 2439 | 3.9% |
| H | 1888 | 3.0% |
| Other values (10) | 9767 |
Common
| Value | Count | Frequency (%) |
| 590 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 63526 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| A | 15018 | |
| E | 7246 | |
| U | 5977 | 9.4% |
| T | 5638 | 8.9% |
| L | 5069 | 8.0% |
| G | 3936 | 6.2% |
| M | 3491 | 5.5% |
| N | 2467 | 3.9% |
| I | 2439 | 3.8% |
| H | 1888 | 3.0% |
| Other values (11) | 10357 |
MUNICIPIO
Text
| Distinct | 189 |
|---|---|
| Distinct (%) | 2.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 51.4 KiB |
Length
| Max length | 27 |
|---|---|
| Median length | 23 |
| Mean length | 12.126866 |
| Min length | 5 |
Characters and Unicode
| Total characters | 79625 |
|---|---|
| Distinct characters | 25 |
| Distinct categories | 2 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 4 ? |
|---|---|
| Unique (%) | 0.1% |
Sample
| 1st row | COBAN |
|---|---|
| 2nd row | COBAN |
| 3rd row | COBAN |
| 4th row | COBAN |
| 5th row | COBAN |
| Value | Count | Frequency (%) |
| ciudad | 1536 | 13.5% |
| capital | 1536 | 13.5% |
| san | 900 | 7.9% |
| villa | 431 | 3.8% |
| mixco | 420 | 3.7% |
| nueva | 400 | 3.5% |
| santa | 251 | 2.2% |
| chimaltenango | 170 | 1.5% |
| mazatenango | 167 | 1.5% |
| escuintla | 164 | 1.4% |
| Other values (211) | 5433 |
Most occurring characters
| Value | Count | Frequency (%) |
| A | 15200 | |
| I | 6708 | 8.4% |
| C | 6068 | 7.6% |
| T | 5052 | 6.3% |
| L | 5019 | 6.3% |
| N | 4964 | 6.2% |
| 4842 | 6.1% | |
| U | 4830 | 6.1% |
| E | 3967 | 5.0% |
| O | 3440 | 4.3% |
| Other values (15) | 19535 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 74783 | |
| Space Separator | 4842 | 6.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 15200 | |
| I | 6708 | |
| C | 6068 | 8.1% |
| T | 5052 | 6.8% |
| L | 5019 | 6.7% |
| N | 4964 | 6.6% |
| U | 4830 | 6.5% |
| E | 3967 | 5.3% |
| O | 3440 | 4.6% |
| D | 3392 | 4.5% |
| Other values (14) | 16143 |
Space Separator
| Value | Count | Frequency (%) |
| 4842 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 74783 | |
| Common | 4842 | 6.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| A | 15200 | |
| I | 6708 | |
| C | 6068 | 8.1% |
| T | 5052 | 6.8% |
| L | 5019 | 6.7% |
| N | 4964 | 6.6% |
| U | 4830 | 6.5% |
| E | 3967 | 5.3% |
| O | 3440 | 4.6% |
| D | 3392 | 4.5% |
| Other values (14) | 16143 |
Common
| Value | Count | Frequency (%) |
| 4842 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 79625 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| A | 15200 | |
| I | 6708 | 8.4% |
| C | 6068 | 7.6% |
| T | 5052 | 6.3% |
| L | 5019 | 6.3% |
| N | 4964 | 6.2% |
| 4842 | 6.1% | |
| U | 4830 | 6.1% |
| E | 3967 | 5.0% |
| O | 3440 | 4.3% |
| Other values (15) | 19535 |
ESTABLECIMIENTO
Text
| Distinct | 3526 |
|---|---|
| Distinct (%) | 53.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 51.4 KiB |
Length
| Max length | 125 |
|---|---|
| Median length | 103 |
| Mean length | 39.777947 |
| Min length | 3 |
Characters and Unicode
| Total characters | 261182 |
|---|---|
| Distinct characters | 49 |
| Distinct categories | 8 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 2260 ? |
|---|---|
| Unique (%) | 34.4% |
Sample
| 1st row | COLEGIO COBAN |
|---|---|
| 2nd row | COLEGIO PARTICULAR MIXTO VERAPAZ |
| 3rd row | COLEGIO "LA INMACULADA" |
| 4th row | ESCUELA NACIONAL DE CIENCIAS COMERCIALES |
| 5th row | INSTITUTO NORMAL MIXTO DEL NORTE 'EMILIO ROSALES PONCE' |
| Value | Count | Frequency (%) |
| de | 2657 | 7.7% |
| colegio | 2495 | 7.3% |
| mixto | 1922 | 5.6% |
| instituto | 1769 | 5.1% |
| liceo | 1347 | 3.9% |
| educacion | 976 | 2.8% |
| privado | 954 | 2.8% |
| centro | 842 | 2.4% |
| diversificada | 547 | 1.6% |
| y | 540 | 1.6% |
| Other values (2443) | 20358 |
Most occurring characters
| Value | Count | Frequency (%) |
| 27849 | ||
| I | 27322 | |
| O | 25757 | |
| E | 23018 | 8.8% |
| A | 22252 | 8.5% |
| C | 18444 | 7.1% |
| T | 16064 | 6.2% |
| N | 15063 | 5.8% |
| L | 12539 | 4.8% |
| R | 11954 | 4.6% |
| Other values (39) | 60920 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 228339 | |
| Space Separator | 27849 | 10.7% |
| Other Punctuation | 3854 | 1.5% |
| Dash Punctuation | 468 | 0.2% |
| Decimal Number | 344 | 0.1% |
| Open Punctuation | 163 | 0.1% |
| Close Punctuation | 162 | 0.1% |
| Modifier Symbol | 3 | < 0.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| I | 27322 | |
| O | 25757 | |
| E | 23018 | |
| A | 22252 | |
| C | 18444 | 8.1% |
| T | 16064 | 7.0% |
| N | 15063 | 6.6% |
| L | 12539 | 5.5% |
| R | 11954 | 5.2% |
| D | 9555 | 4.2% |
| Other values (16) | 46371 |
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 125 | |
| 0 | 70 | |
| 1 | 54 | |
| 3 | 31 | 9.0% |
| 4 | 19 | 5.5% |
| 7 | 15 | 4.4% |
| 6 | 11 | 3.2% |
| 8 | 7 | 2.0% |
| 9 | 6 | 1.7% |
| 5 | 6 | 1.7% |
Other Punctuation
| Value | Count | Frequency (%) |
| " | 2336 | |
| . | 719 | 18.7% |
| ' | 707 | 18.3% |
| , | 77 | 2.0% |
| & | 7 | 0.2% |
| / | 6 | 0.2% |
| % | 1 | < 0.1% |
| # | 1 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 27849 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 468 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 163 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 162 |
Modifier Symbol
| Value | Count | Frequency (%) |
| ` | 3 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 228339 | |
| Common | 32843 | 12.6% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| I | 27322 | |
| O | 25757 | |
| E | 23018 | |
| A | 22252 | |
| C | 18444 | 8.1% |
| T | 16064 | 7.0% |
| N | 15063 | 6.6% |
| L | 12539 | 5.5% |
| R | 11954 | 5.2% |
| D | 9555 | 4.2% |
| Other values (16) | 46371 |
Common
| Value | Count | Frequency (%) |
| 27849 | ||
| " | 2336 | 7.1% |
| . | 719 | 2.2% |
| ' | 707 | 2.2% |
| - | 468 | 1.4% |
| ( | 163 | 0.5% |
| ) | 162 | 0.5% |
| 2 | 125 | 0.4% |
| , | 77 | 0.2% |
| 0 | 70 | 0.2% |
| Other values (13) | 167 | 0.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 261182 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 27849 | ||
| I | 27322 | |
| O | 25757 | |
| E | 23018 | 8.8% |
| A | 22252 | 8.5% |
| C | 18444 | 7.1% |
| T | 16064 | 6.2% |
| N | 15063 | 5.8% |
| L | 12539 | 4.8% |
| R | 11954 | 4.6% |
| Other values (39) | 60920 |
DIRECCION
Text
| Distinct | 4216 |
|---|---|
| Distinct (%) | 64.6% |
| Missing | 42 |
| Missing (%) | 0.6% |
| Memory size | 51.4 KiB |
Length
| Max length | 110 |
|---|---|
| Median length | 90 |
| Mean length | 29.358063 |
| Min length | 4 |
Characters and Unicode
| Total characters | 191532 |
|---|---|
| Distinct characters | 49 |
| Distinct categories | 8 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 3113 ? |
|---|---|
| Unique (%) | 47.7% |
Sample
| 1st row | KM.2 SALIDA A SAN JUAN CHAMELCO ZONA 8 |
|---|---|
| 2nd row | KM 209.5 ENTRADA A LA CIUDAD |
| 3rd row | 7A. AVENIDA 11-109 ZONA 6 |
| 4th row | 2A CALLE 11-10 ZONA 2 |
| 5th row | 3A AVE 6-23 ZONA 11 |
| Value | Count | Frequency (%) |
| zona | 2829 | 7.9% |
| calle | 2158 | 6.0% |
| avenida | 1682 | 4.7% |
| 1 | 1217 | 3.4% |
| colonia | 931 | 2.6% |
| barrio | 810 | 2.3% |
| aldea | 693 | 1.9% |
| san | 662 | 1.8% |
| el | 641 | 1.8% |
| la | 443 | 1.2% |
| Other values (2879) | 23864 |
Most occurring characters
| Value | Count | Frequency (%) |
| 29406 | ||
| A | 27517 | |
| E | 12045 | 6.3% |
| L | 11999 | 6.3% |
| O | 11429 | 6.0% |
| N | 11168 | 5.8% |
| I | 8816 | 4.6% |
| C | 7408 | 3.9% |
| R | 6805 | 3.6% |
| 1 | 5391 | 2.8% |
| Other values (39) | 59548 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 128153 | |
| Space Separator | 29406 | 15.4% |
| Decimal Number | 23603 | 12.3% |
| Other Punctuation | 6355 | 3.3% |
| Dash Punctuation | 3966 | 2.1% |
| Lowercase Letter | 17 | < 0.1% |
| Open Punctuation | 16 | < 0.1% |
| Close Punctuation | 16 | < 0.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 27517 | |
| E | 12045 | |
| L | 11999 | |
| O | 11429 | |
| N | 11168 | |
| I | 8816 | 6.9% |
| C | 7408 | 5.8% |
| R | 6805 | 5.3% |
| D | 4594 | 3.6% |
| T | 4250 | 3.3% |
| Other values (16) | 22122 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 5391 | |
| 2 | 3213 | |
| 3 | 2777 | |
| 4 | 2438 | |
| 5 | 2230 | |
| 0 | 1943 | 8.2% |
| 6 | 1732 | 7.3% |
| 7 | 1486 | 6.3% |
| 9 | 1207 | 5.1% |
| 8 | 1186 | 5.0% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 3732 | |
| , | 2109 | |
| " | 383 | 6.0% |
| ' | 86 | 1.4% |
| / | 29 | 0.5% |
| # | 15 | 0.2% |
| ; | 1 | < 0.1% |
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 16 | |
| o | 1 | 5.9% |
Space Separator
| Value | Count | Frequency (%) |
| 29406 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 3966 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 16 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 16 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 128170 | |
| Common | 63362 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| A | 27517 | |
| E | 12045 | |
| L | 11999 | |
| O | 11429 | |
| N | 11168 | |
| I | 8816 | 6.9% |
| C | 7408 | 5.8% |
| R | 6805 | 5.3% |
| D | 4594 | 3.6% |
| T | 4250 | 3.3% |
| Other values (18) | 22139 |
Common
| Value | Count | Frequency (%) |
| 29406 | ||
| 1 | 5391 | 8.5% |
| - | 3966 | 6.3% |
| . | 3732 | 5.9% |
| 2 | 3213 | 5.1% |
| 3 | 2777 | 4.4% |
| 4 | 2438 | 3.8% |
| 5 | 2230 | 3.5% |
| , | 2109 | 3.3% |
| 0 | 1943 | 3.1% |
| Other values (11) | 6157 | 9.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 191532 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 29406 | ||
| A | 27517 | |
| E | 12045 | 6.3% |
| L | 11999 | 6.3% |
| O | 11429 | 6.0% |
| N | 11168 | 5.8% |
| I | 8816 | 4.6% |
| C | 7408 | 3.9% |
| R | 6805 | 3.6% |
| 1 | 5391 | 2.8% |
| Other values (39) | 59548 |
TELEFONO
Text
MISSING 
| Distinct | 3382 |
|---|---|
| Distinct (%) | 59.7% |
| Missing | 901 |
| Missing (%) | 13.7% |
| Memory size | 51.4 KiB |
Length
| Max length | 8 |
|---|---|
| Median length | 8 |
| Mean length | 8 |
| Min length | 8 |
Characters and Unicode
| Total characters | 45320 |
|---|---|
| Distinct characters | 16 |
| Distinct categories | 5 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 2237 ? |
|---|---|
| Unique (%) | 39.5% |
Sample
| 1st row | 77945104 |
|---|---|
| 2nd row | 77367402 |
| 3rd row | 78232301 |
| 4th row | 79514215 |
| 5th row | 79521468 |
| Value | Count | Frequency (%) |
| 22067425 | 21 | 0.4% |
| 79480009 | 14 | 0.2% |
| 22093200 | 12 | 0.2% |
| 77746400 | 11 | 0.2% |
| 45353648 | 11 | 0.2% |
| 59304894 | 11 | 0.2% |
| 24637777 | 10 | 0.2% |
| 78899679 | 10 | 0.2% |
| 22322912 | 10 | 0.2% |
| 78394519 | 9 | 0.2% |
| Other values (3374) | 5549 |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 6312 | |
| 4 | 5266 | |
| 7 | 4868 | |
| 3 | 4690 | |
| 5 | 4667 | |
| 0 | 4117 | |
| 8 | 4093 | |
| 6 | 3917 | |
| 9 | 3733 | |
| 1 | 3622 | |
| Other values (6) | 35 | 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 45285 | |
| Dash Punctuation | 18 | < 0.1% |
| Other Punctuation | 8 | < 0.1% |
| Space Separator | 7 | < 0.1% |
| Uppercase Letter | 2 | < 0.1% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 6312 | |
| 4 | 5266 | |
| 7 | 4868 | |
| 3 | 4690 | |
| 5 | 4667 | |
| 0 | 4117 | |
| 8 | 4093 | |
| 6 | 3917 | |
| 9 | 3733 | |
| 1 | 3622 |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 7 | |
| / | 1 | 12.5% |
Uppercase Letter
| Value | Count | Frequency (%) |
| E | 1 | |
| Y | 1 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 18 |
Space Separator
| Value | Count | Frequency (%) |
| 7 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 45318 | |
| Latin | 2 | < 0.1% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 2 | 6312 | |
| 4 | 5266 | |
| 7 | 4868 | |
| 3 | 4690 | |
| 5 | 4667 | |
| 0 | 4117 | |
| 8 | 4093 | |
| 6 | 3917 | |
| 9 | 3733 | |
| 1 | 3622 | |
| Other values (4) | 33 | 0.1% |
Latin
| Value | Count | Frequency (%) |
| E | 1 | |
| Y | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 45320 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 2 | 6312 | |
| 4 | 5266 | |
| 7 | 4868 | |
| 3 | 4690 | |
| 5 | 4667 | |
| 0 | 4117 | |
| 8 | 4093 | |
| 6 | 3917 | |
| 9 | 3733 | |
| 1 | 3622 | |
| Other values (6) | 35 | 0.1% |
SUPERVISOR
Text
MISSING 
| Distinct | 417 |
|---|---|
| Distinct (%) | 6.5% |
| Missing | 163 |
| Missing (%) | 2.5% |
| Memory size | 51.4 KiB |
Length
| Max length | 63 |
|---|---|
| Median length | 43 |
| Mean length | 29.576136 |
| Min length | 14 |
Characters and Unicode
| Total characters | 189376 |
|---|---|
| Distinct characters | 29 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 40 ? |
|---|---|
| Unique (%) | 0.6% |
Sample
| 1st row | MERCEDES JOSEFINA TORRES GALVEZ |
|---|---|
| 2nd row | MERCEDES JOSEFINA TORRES GALVEZ |
| 3rd row | MERCEDES JOSEFINA TORRES GALVEZ |
| 4th row | RUDY ADOLFO TOT OCH |
| 5th row | RUDY ADOLFO TOT OCH |
| Value | Count | Frequency (%) |
| de | 1596 | 5.7% |
| martinez | 543 | 2.0% |
| gonzalez | 444 | 1.6% |
| leon | 424 | 1.5% |
| lopez | 396 | 1.4% |
| morales | 373 | 1.3% |
| carlos | 368 | 1.3% |
| juan | 353 | 1.3% |
| humberto | 310 | 1.1% |
| hernandez | 299 | 1.1% |
| Other values (866) | 22676 |
Most occurring characters
| Value | Count | Frequency (%) |
| A | 24041 | |
| 21379 | ||
| E | 18057 | 9.5% |
| R | 15128 | 8.0% |
| O | 14004 | 7.4% |
| I | 12847 | 6.8% |
| L | 11894 | 6.3% |
| N | 11046 | 5.8% |
| S | 7792 | 4.1% |
| D | 6550 | 3.5% |
| Other values (19) | 46638 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 167867 | |
| Space Separator | 21379 | 11.3% |
| Dash Punctuation | 124 | 0.1% |
| Other Punctuation | 6 | < 0.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 24041 | |
| E | 18057 | |
| R | 15128 | 9.0% |
| O | 14004 | 8.3% |
| I | 12847 | 7.7% |
| L | 11894 | 7.1% |
| N | 11046 | 6.6% |
| S | 7792 | 4.6% |
| D | 6550 | 3.9% |
| C | 6273 | 3.7% |
| Other values (16) | 40235 |
Space Separator
| Value | Count | Frequency (%) |
| 21379 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 124 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 6 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 167867 | |
| Common | 21509 | 11.4% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| A | 24041 | |
| E | 18057 | |
| R | 15128 | 9.0% |
| O | 14004 | 8.3% |
| I | 12847 | 7.7% |
| L | 11894 | 7.1% |
| N | 11046 | 6.6% |
| S | 7792 | 4.6% |
| D | 6550 | 3.9% |
| C | 6273 | 3.7% |
| Other values (16) | 40235 |
Common
| Value | Count | Frequency (%) |
| 21379 | ||
| - | 124 | 0.6% |
| . | 6 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 189376 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| A | 24041 | |
| 21379 | ||
| E | 18057 | 9.5% |
| R | 15128 | 8.0% |
| O | 14004 | 7.4% |
| I | 12847 | 6.8% |
| L | 11894 | 6.3% |
| N | 11046 | 5.8% |
| S | 7792 | 4.1% |
| D | 6550 | 3.5% |
| Other values (19) | 46638 |
DIRECTOR
Text
MISSING 
| Distinct | 3138 |
|---|---|
| Distinct (%) | 60.9% |
| Missing | 1413 |
| Missing (%) | 21.5% |
| Memory size | 51.4 KiB |
Length
| Max length | 55 |
|---|---|
| Median length | 47 |
| Mean length | 28.889385 |
| Min length | 8 |
Characters and Unicode
| Total characters | 148867 |
|---|---|
| Distinct characters | 33 |
| Distinct categories | 6 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 2102 ? |
|---|---|
| Unique (%) | 40.8% |
Sample
| 1st row | JULIO CESAR VILLELA AMADO |
|---|---|
| 2nd row | VIRGINA SOLANO SERRANO |
| 3rd row | HECOTR WALDEMAR TOT COY |
| 4th row | LUIS FERNANDO SOTO |
| 5th row | MERCEDES QUIROS QUIROS |
| Value | Count | Frequency (%) |
| de | 1033 | 4.7% |
| lopez | 419 | 1.9% |
| maria | 284 | 1.3% |
| garcia | 264 | 1.2% |
| morales | 245 | 1.1% |
| hernandez | 226 | 1.0% |
| perez | 191 | 0.9% |
| gonzalez | 172 | 0.8% |
| jose | 159 | 0.7% |
| martinez | 154 | 0.7% |
| Other values (2900) | 18734 |
Most occurring characters
| Value | Count | Frequency (%) |
| A | 19876 | |
| 16728 | ||
| E | 14329 | 9.6% |
| R | 11929 | 8.0% |
| O | 10562 | 7.1% |
| I | 10091 | 6.8% |
| L | 9240 | 6.2% |
| N | 8673 | 5.8% |
| S | 6055 | 4.1% |
| D | 5449 | 3.7% |
| Other values (23) | 35935 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 131998 | |
| Space Separator | 16728 | 11.2% |
| Dash Punctuation | 70 | < 0.1% |
| Other Punctuation | 69 | < 0.1% |
| Open Punctuation | 1 | < 0.1% |
| Close Punctuation | 1 | < 0.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 19876 | |
| E | 14329 | |
| R | 11929 | 9.0% |
| O | 10562 | 8.0% |
| I | 10091 | 7.6% |
| L | 9240 | 7.0% |
| N | 8673 | 6.6% |
| S | 6055 | 4.6% |
| D | 5449 | 4.1% |
| C | 4782 | 3.6% |
| Other values (16) | 31012 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 65 | |
| " | 2 | 2.9% |
| , | 2 | 2.9% |
Space Separator
| Value | Count | Frequency (%) |
| 16728 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 70 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 1 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 131998 | |
| Common | 16869 | 11.3% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| A | 19876 | |
| E | 14329 | |
| R | 11929 | 9.0% |
| O | 10562 | 8.0% |
| I | 10091 | 7.6% |
| L | 9240 | 7.0% |
| N | 8673 | 6.6% |
| S | 6055 | 4.6% |
| D | 5449 | 4.1% |
| C | 4782 | 3.6% |
| Other values (16) | 31012 |
Common
| Value | Count | Frequency (%) |
| 16728 | ||
| - | 70 | 0.4% |
| . | 65 | 0.4% |
| " | 2 | < 0.1% |
| , | 2 | < 0.1% |
| ( | 1 | < 0.1% |
| ) | 1 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 148867 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| A | 19876 | |
| 16728 | ||
| E | 14329 | 9.6% |
| R | 11929 | 8.0% |
| O | 10562 | 7.1% |
| I | 10091 | 6.8% |
| L | 9240 | 6.2% |
| N | 8673 | 5.8% |
| S | 6055 | 4.1% |
| D | 5449 | 3.7% |
| Other values (23) | 35935 |
NIVEL
Categorical
CONSTANT 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 51.4 KiB |
| DIVERSIFICADO |
|---|
Length
| Max length | 13 |
|---|---|
| Median length | 13 |
| Mean length | 13 |
| Min length | 13 |
Characters and Unicode
| Total characters | 85358 |
|---|---|
| Distinct characters | 10 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | DIVERSIFICADO |
|---|---|
| 2nd row | DIVERSIFICADO |
| 3rd row | DIVERSIFICADO |
| 4th row | DIVERSIFICADO |
| 5th row | DIVERSIFICADO |
Common Values
| Value | Count | Frequency (%) |
| DIVERSIFICADO | 6566 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| diversificado | 6566 |
Most occurring characters
| Value | Count | Frequency (%) |
| I | 19698 | |
| D | 13132 | |
| V | 6566 | 7.7% |
| E | 6566 | 7.7% |
| R | 6566 | 7.7% |
| S | 6566 | 7.7% |
| F | 6566 | 7.7% |
| C | 6566 | 7.7% |
| A | 6566 | 7.7% |
| O | 6566 | 7.7% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 85358 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| I | 19698 | |
| D | 13132 | |
| V | 6566 | 7.7% |
| E | 6566 | 7.7% |
| R | 6566 | 7.7% |
| S | 6566 | 7.7% |
| F | 6566 | 7.7% |
| C | 6566 | 7.7% |
| A | 6566 | 7.7% |
| O | 6566 | 7.7% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 85358 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| I | 19698 | |
| D | 13132 | |
| V | 6566 | 7.7% |
| E | 6566 | 7.7% |
| R | 6566 | 7.7% |
| S | 6566 | 7.7% |
| F | 6566 | 7.7% |
| C | 6566 | 7.7% |
| A | 6566 | 7.7% |
| O | 6566 | 7.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 85358 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| I | 19698 | |
| D | 13132 | |
| V | 6566 | 7.7% |
| E | 6566 | 7.7% |
| R | 6566 | 7.7% |
| S | 6566 | 7.7% |
| F | 6566 | 7.7% |
| C | 6566 | 7.7% |
| A | 6566 | 7.7% |
| O | 6566 | 7.7% |
SECTOR
Categorical
IMBALANCE 
| Distinct | 4 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 51.4 KiB |
| PRIVADO | |
|---|---|
| OFICIAL | |
| COOPERATIVA | 112 |
| MUNICIPAL | 94 |
Length
| Max length | 11 |
|---|---|
| Median length | 7 |
| Mean length | 7.0968626 |
| Min length | 7 |
Characters and Unicode
| Total characters | 46598 |
|---|---|
| Distinct characters | 15 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | PRIVADO |
|---|---|
| 2nd row | PRIVADO |
| 3rd row | PRIVADO |
| 4th row | OFICIAL |
| 5th row | OFICIAL |
Common Values
| Value | Count | Frequency (%) |
| PRIVADO | 5723 | |
| OFICIAL | 637 | 9.7% |
| COOPERATIVA | 112 | 1.7% |
| MUNICIPAL | 94 | 1.4% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| privado | 5723 | |
| oficial | 637 | 9.7% |
| cooperativa | 112 | 1.7% |
| municipal | 94 | 1.4% |
Most occurring characters
| Value | Count | Frequency (%) |
| I | 7297 | |
| A | 6678 | |
| O | 6584 | |
| P | 5929 | |
| R | 5835 | |
| V | 5835 | |
| D | 5723 | |
| C | 843 | 1.8% |
| L | 731 | 1.6% |
| F | 637 | 1.4% |
| Other values (5) | 506 | 1.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 46598 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| I | 7297 | |
| A | 6678 | |
| O | 6584 | |
| P | 5929 | |
| R | 5835 | |
| V | 5835 | |
| D | 5723 | |
| C | 843 | 1.8% |
| L | 731 | 1.6% |
| F | 637 | 1.4% |
| Other values (5) | 506 | 1.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 46598 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| I | 7297 | |
| A | 6678 | |
| O | 6584 | |
| P | 5929 | |
| R | 5835 | |
| V | 5835 | |
| D | 5723 | |
| C | 843 | 1.8% |
| L | 731 | 1.6% |
| F | 637 | 1.4% |
| Other values (5) | 506 | 1.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 46598 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| I | 7297 | |
| A | 6678 | |
| O | 6584 | |
| P | 5929 | |
| R | 5835 | |
| V | 5835 | |
| D | 5723 | |
| C | 843 | 1.8% |
| L | 731 | 1.6% |
| F | 637 | 1.4% |
| Other values (5) | 506 | 1.1% |
AREA
Categorical
IMBALANCE 
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 51.4 KiB |
| URBANA | |
|---|---|
| RURAL | |
| SIN ESPECIFICAR | 1 |
Length
| Max length | 15 |
|---|---|
| Median length | 6 |
| Mean length | 5.8371916 |
| Min length | 5 |
Characters and Unicode
| Total characters | 38327 |
|---|---|
| Distinct characters | 13 |
| Distinct categories | 2 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | URBANA |
|---|---|
| 2nd row | URBANA |
| 3rd row | URBANA |
| 4th row | URBANA |
| 5th row | URBANA |
Common Values
| Value | Count | Frequency (%) |
| URBANA | 5487 | |
| RURAL | 1078 | 16.4% |
| SIN ESPECIFICAR | 1 | < 0.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| urbana | 5487 | |
| rural | 1078 | 16.4% |
| sin | 1 | < 0.1% |
| especificar | 1 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| A | 12053 | |
| R | 7644 | |
| U | 6565 | |
| N | 5488 | |
| B | 5487 | |
| L | 1078 | 2.8% |
| I | 3 | < 0.1% |
| S | 2 | < 0.1% |
| E | 2 | < 0.1% |
| C | 2 | < 0.1% |
| Other values (3) | 3 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 38326 | |
| Space Separator | 1 | < 0.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 12053 | |
| R | 7644 | |
| U | 6565 | |
| N | 5488 | |
| B | 5487 | |
| L | 1078 | 2.8% |
| I | 3 | < 0.1% |
| S | 2 | < 0.1% |
| E | 2 | < 0.1% |
| C | 2 | < 0.1% |
| Other values (2) | 2 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 38326 | |
| Common | 1 | < 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| A | 12053 | |
| R | 7644 | |
| U | 6565 | |
| N | 5488 | |
| B | 5487 | |
| L | 1078 | 2.8% |
| I | 3 | < 0.1% |
| S | 2 | < 0.1% |
| E | 2 | < 0.1% |
| C | 2 | < 0.1% |
| Other values (2) | 2 | < 0.1% |
Common
| Value | Count | Frequency (%) |
| 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 38327 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| A | 12053 | |
| R | 7644 | |
| U | 6565 | |
| N | 5488 | |
| B | 5487 | |
| L | 1078 | 2.8% |
| I | 3 | < 0.1% |
| S | 2 | < 0.1% |
| E | 2 | < 0.1% |
| C | 2 | < 0.1% |
| Other values (3) | 3 | < 0.1% |
STATUS
Categorical
IMBALANCE 
| Distinct | 4 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 51.4 KiB |
| ABIERTA | |
|---|---|
| CERRADA TEMPORALMENTE | |
| TEMPORAL TITULOS | 102 |
| TEMPORAL NOMBRAMIENTO | 2 |
Length
| Max length | 21 |
|---|---|
| Median length | 7 |
| Mean length | 11.075845 |
| Min length | 7 |
Characters and Unicode
| Total characters | 72724 |
|---|---|
| Distinct characters | 16 |
| Distinct categories | 2 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | ABIERTA |
|---|---|
| 2nd row | ABIERTA |
| 3rd row | ABIERTA |
| 4th row | ABIERTA |
| 5th row | ABIERTA |
Common Values
| Value | Count | Frequency (%) |
| ABIERTA | 4618 | |
| CERRADA TEMPORALMENTE | 1844 | 28.1% |
| TEMPORAL TITULOS | 102 | 1.6% |
| TEMPORAL NOMBRAMIENTO | 2 | < 0.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| abierta | 4618 | |
| cerrada | 1844 | 21.7% |
| temporalmente | 1844 | 21.7% |
| temporal | 104 | 1.2% |
| titulos | 102 | 1.2% |
| nombramiento | 2 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| A | 14874 | |
| E | 12100 | |
| R | 10256 | |
| T | 8616 | |
| I | 4722 | 6.5% |
| B | 4620 | 6.4% |
| M | 3796 | 5.2% |
| O | 2054 | 2.8% |
| L | 2050 | 2.8% |
| 1948 | 2.7% | |
| Other values (6) | 7688 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 70776 | |
| Space Separator | 1948 | 2.7% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 14874 | |
| E | 12100 | |
| R | 10256 | |
| T | 8616 | |
| I | 4722 | 6.7% |
| B | 4620 | 6.5% |
| M | 3796 | 5.4% |
| O | 2054 | 2.9% |
| L | 2050 | 2.9% |
| P | 1948 | 2.8% |
| Other values (5) | 5740 | 8.1% |
Space Separator
| Value | Count | Frequency (%) |
| 1948 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 70776 | |
| Common | 1948 | 2.7% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| A | 14874 | |
| E | 12100 | |
| R | 10256 | |
| T | 8616 | |
| I | 4722 | 6.7% |
| B | 4620 | 6.5% |
| M | 3796 | 5.4% |
| O | 2054 | 2.9% |
| L | 2050 | 2.9% |
| P | 1948 | 2.8% |
| Other values (5) | 5740 | 8.1% |
Common
| Value | Count | Frequency (%) |
| 1948 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 72724 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| A | 14874 | |
| E | 12100 | |
| R | 10256 | |
| T | 8616 | |
| I | 4722 | 6.5% |
| B | 4620 | 6.4% |
| M | 3796 | 5.2% |
| O | 2054 | 2.8% |
| L | 2050 | 2.8% |
| 1948 | 2.7% | |
| Other values (6) | 7688 |
MODALIDAD
Categorical
IMBALANCE 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 51.4 KiB |
| MONOLINGUE | |
|---|---|
| BILINGUE | 161 |
Length
| Max length | 10 |
|---|---|
| Median length | 10 |
| Mean length | 9.9509595 |
| Min length | 8 |
Characters and Unicode
| Total characters | 65338 |
|---|---|
| Distinct characters | 9 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | MONOLINGUE |
|---|---|
| 2nd row | MONOLINGUE |
| 3rd row | MONOLINGUE |
| 4th row | MONOLINGUE |
| 5th row | BILINGUE |
Common Values
| Value | Count | Frequency (%) |
| MONOLINGUE | 6405 | |
| BILINGUE | 161 | 2.5% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| monolingue | 6405 | |
| bilingue | 161 | 2.5% |
Most occurring characters
| Value | Count | Frequency (%) |
| N | 12971 | |
| O | 12810 | |
| I | 6727 | |
| L | 6566 | |
| G | 6566 | |
| U | 6566 | |
| E | 6566 | |
| M | 6405 | |
| B | 161 | 0.2% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 65338 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 12971 | |
| O | 12810 | |
| I | 6727 | |
| L | 6566 | |
| G | 6566 | |
| U | 6566 | |
| E | 6566 | |
| M | 6405 | |
| B | 161 | 0.2% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 65338 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| N | 12971 | |
| O | 12810 | |
| I | 6727 | |
| L | 6566 | |
| G | 6566 | |
| U | 6566 | |
| E | 6566 | |
| M | 6405 | |
| B | 161 | 0.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 65338 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| N | 12971 | |
| O | 12810 | |
| I | 6727 | |
| L | 6566 | |
| G | 6566 | |
| U | 6566 | |
| E | 6566 | |
| M | 6405 | |
| B | 161 | 0.2% |
JORNADA
Categorical
HIGH CORRELATION 
| Distinct | 6 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 51.4 KiB |
| DOBLE | |
|---|---|
| MATUTINA | |
| VESPERTINA | |
| SIN JORNADA | |
| NOCTURNA | 210 |
Length
| Max length | 11 |
|---|---|
| Median length | 10 |
| Mean length | 7.8062747 |
| Min length | 5 |
Characters and Unicode
| Total characters | 51256 |
|---|---|
| Distinct characters | 18 |
| Distinct categories | 2 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | MATUTINA |
|---|---|
| 2nd row | MATUTINA |
| 3rd row | MATUTINA |
| 4th row | MATUTINA |
| 5th row | VESPERTINA |
Common Values
| Value | Count | Frequency (%) |
| DOBLE | 2208 | |
| MATUTINA | 1790 | |
| VESPERTINA | 1647 | |
| SIN JORNADA | 636 | 9.7% |
| NOCTURNA | 210 | 3.2% |
| INTERMEDIA | 75 | 1.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| doble | 2208 | |
| matutina | 1790 | |
| vespertina | 1647 | |
| sin | 636 | 8.8% |
| jornada | 636 | 8.8% |
| nocturna | 210 | 2.9% |
| intermedia | 75 | 1.0% |
Most occurring characters
| Value | Count | Frequency (%) |
| A | 6784 | |
| E | 5652 | |
| T | 5512 | |
| N | 5204 | |
| I | 4223 | 8.2% |
| O | 3054 | 6.0% |
| D | 2919 | 5.7% |
| R | 2568 | 5.0% |
| S | 2283 | 4.5% |
| L | 2208 | 4.3% |
| Other values (8) | 10849 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 50620 | |
| Space Separator | 636 | 1.2% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 6784 | |
| E | 5652 | |
| T | 5512 | |
| N | 5204 | |
| I | 4223 | |
| O | 3054 | 6.0% |
| D | 2919 | 5.8% |
| R | 2568 | 5.1% |
| S | 2283 | 4.5% |
| L | 2208 | 4.4% |
| Other values (7) | 10213 |
Space Separator
| Value | Count | Frequency (%) |
| 636 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 50620 | |
| Common | 636 | 1.2% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| A | 6784 | |
| E | 5652 | |
| T | 5512 | |
| N | 5204 | |
| I | 4223 | |
| O | 3054 | 6.0% |
| D | 2919 | 5.8% |
| R | 2568 | 5.1% |
| S | 2283 | 4.5% |
| L | 2208 | 4.4% |
| Other values (7) | 10213 |
Common
| Value | Count | Frequency (%) |
| 636 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 51256 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| A | 6784 | |
| E | 5652 | |
| T | 5512 | |
| N | 5204 | |
| I | 4223 | 8.2% |
| O | 3054 | 6.0% |
| D | 2919 | 5.7% |
| R | 2568 | 5.0% |
| S | 2283 | 4.5% |
| L | 2208 | 4.3% |
| Other values (8) | 10849 |
PLAN
Categorical
HIGH CORRELATION  IMBALANCE 
| Distinct | 13 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 51.4 KiB |
| DIARIO(REGULAR) | |
|---|---|
| FIN DE SEMANA | |
| SEMIPRESENCIAL (FIN DE SEMANA) | 307 |
| SEMIPRESENCIAL (UN DIA A LA SEMANA) | 258 |
| A DISTANCIA | 105 |
| Other values (8) | 198 |
Length
| Max length | 37 |
|---|---|
| Median length | 15 |
| Mean length | 16.011422 |
| Min length | 5 |
Characters and Unicode
| Total characters | 105131 |
|---|---|
| Distinct characters | 22 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | DIARIO(REGULAR) |
|---|---|
| 2nd row | DIARIO(REGULAR) |
| 3rd row | DIARIO(REGULAR) |
| 4th row | DIARIO(REGULAR) |
| 5th row | DIARIO(REGULAR) |
Common Values
| Value | Count | Frequency (%) |
| DIARIO(REGULAR) | 3925 | |
| FIN DE SEMANA | 1773 | |
| SEMIPRESENCIAL (FIN DE SEMANA) | 307 | 4.7% |
| SEMIPRESENCIAL (UN DIA A LA SEMANA) | 258 | 3.9% |
| A DISTANCIA | 105 | 1.6% |
| SEMIPRESENCIAL | 68 | 1.0% |
| SEMIPRESENCIAL (DOS DIAS A LA SEMANA) | 51 | 0.8% |
| SABATINO | 34 | 0.5% |
| VIRTUAL A DISTANCIA | 30 | 0.5% |
| DOMINICAL | 9 | 0.1% |
| Other values (3) | 6 | 0.1% |
Length
| Value | Count | Frequency (%) |
| diario(regular | 3925 | |
| semana | 2389 | |
| fin | 2080 | |
| de | 2080 | |
| semipresencial | 684 | 5.4% |
| a | 444 | 3.5% |
| la | 309 | 2.4% |
| un | 258 | 2.0% |
| dia | 258 | 2.0% |
| distancia | 135 | 1.1% |
| Other values (8) | 181 | 1.4% |
Most occurring characters
| Value | Count | Frequency (%) |
| A | 14757 | |
| R | 12497 | |
| I | 11965 | |
| E | 10450 | |
| D | 6511 | 6.2% |
| 6177 | 5.9% | |
| N | 5591 | 5.3% |
| L | 4961 | 4.7% |
| ( | 4541 | 4.3% |
| ) | 4541 | 4.3% |
| Other values (12) | 23140 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 89872 | |
| Space Separator | 6177 | 5.9% |
| Open Punctuation | 4541 | 4.3% |
| Close Punctuation | 4541 | 4.3% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 14757 | |
| R | 12497 | |
| I | 11965 | |
| E | 10450 | |
| D | 6511 | |
| N | 5591 | 6.2% |
| L | 4961 | 5.5% |
| U | 4215 | 4.7% |
| S | 4028 | 4.5% |
| O | 4023 | 4.5% |
| Other values (9) | 10874 |
Space Separator
| Value | Count | Frequency (%) |
| 6177 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 4541 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 4541 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 89872 | |
| Common | 15259 | 14.5% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| A | 14757 | |
| R | 12497 | |
| I | 11965 | |
| E | 10450 | |
| D | 6511 | |
| N | 5591 | 6.2% |
| L | 4961 | 5.5% |
| U | 4215 | 4.7% |
| S | 4028 | 4.5% |
| O | 4023 | 4.5% |
| Other values (9) | 10874 |
Common
| Value | Count | Frequency (%) |
| 6177 | ||
| ( | 4541 | |
| ) | 4541 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 105131 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| A | 14757 | |
| R | 12497 | |
| I | 11965 | |
| E | 10450 | |
| D | 6511 | 6.2% |
| 6177 | 5.9% | |
| N | 5591 | 5.3% |
| L | 4961 | 4.7% |
| ( | 4541 | 4.3% |
| ) | 4541 | 4.3% |
| Other values (12) | 23140 |
DEPARTAMENTAL
Categorical
HIGH CORRELATION 
| Distinct | 17 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 51.4 KiB |
| GUATEMALA NORTE | |
|---|---|
| GUATEMALA SUR | |
| GUATEMALA OCCIDENTE | |
| ESCUINTLA | |
| HUEHUETENANGO | |
| Other values (12) |
Length
| Max length | 19 |
|---|---|
| Median length | 15 |
| Mean length | 12.728602 |
| Min length | 6 |
Characters and Unicode
| Total characters | 83576 |
|---|---|
| Distinct characters | 22 |
| Distinct categories | 2 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | ALTA VERAPAZ |
|---|---|
| 2nd row | ALTA VERAPAZ |
| 3rd row | ALTA VERAPAZ |
| 4th row | ALTA VERAPAZ |
| 5th row | ALTA VERAPAZ |
Common Values
| Value | Count | Frequency (%) |
| GUATEMALA NORTE | 1037 | |
| GUATEMALA SUR | 796 | |
| GUATEMALA OCCIDENTE | 774 | |
| ESCUINTLA | 599 | |
| HUEHUETENANGO | 495 | |
| SUCHITEPEQUEZ | 377 | 5.7% |
| GUATEMALA ORIENTE | 363 | 5.5% |
| IZABAL | 360 | 5.5% |
| CHIMALTENANGO | 349 | 5.3% |
| ALTA VERAPAZ | 348 | 5.3% |
| Other values (7) | 1068 |
Length
| Value | Count | Frequency (%) |
| guatemala | 2970 | |
| norte | 1037 | 10.2% |
| sur | 796 | 7.9% |
| occidente | 774 | 7.6% |
| escuintla | 599 | 5.9% |
| huehuetenango | 495 | 4.9% |
| verapaz | 468 | 4.6% |
| suchitepequez | 377 | 3.7% |
| oriente | 363 | 3.6% |
| izabal | 360 | 3.6% |
| Other values (10) | 1887 |
Most occurring characters
| Value | Count | Frequency (%) |
| A | 15018 | |
| E | 10557 | |
| T | 7812 | |
| U | 6773 | 8.1% |
| L | 5069 | 6.1% |
| N | 4641 | 5.6% |
| G | 3936 | 4.7% |
| I | 3576 | 4.3% |
| 3560 | 4.3% | |
| M | 3491 | 4.2% |
| Other values (12) | 19143 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 80016 | |
| Space Separator | 3560 | 4.3% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 15018 | |
| E | 10557 | |
| T | 7812 | |
| U | 6773 | |
| L | 5069 | 6.3% |
| N | 4641 | 5.8% |
| G | 3936 | 4.9% |
| I | 3576 | 4.5% |
| M | 3491 | 4.4% |
| O | 3442 | 4.3% |
| Other values (11) | 15701 |
Space Separator
| Value | Count | Frequency (%) |
| 3560 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 80016 | |
| Common | 3560 | 4.3% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| A | 15018 | |
| E | 10557 | |
| T | 7812 | |
| U | 6773 | |
| L | 5069 | 6.3% |
| N | 4641 | 5.8% |
| G | 3936 | 4.9% |
| I | 3576 | 4.5% |
| M | 3491 | 4.4% |
| O | 3442 | 4.3% |
| Other values (11) | 15701 |
Common
| Value | Count | Frequency (%) |
| 3560 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 83576 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| A | 15018 | |
| E | 10557 | |
| T | 7812 | |
| U | 6773 | 8.1% |
| L | 5069 | 6.1% |
| N | 4641 | 5.6% |
| G | 3936 | 4.7% |
| I | 3576 | 4.3% |
| 3560 | 4.3% | |
| M | 3491 | 4.2% |
| Other values (12) | 19143 |
ZONA
Categorical
HIGH CORRELATION  MISSING 
| Distinct | 21 |
|---|---|
| Distinct (%) | 1.4% |
| Missing | 5030 |
| Missing (%) | 76.6% |
| Memory size | 51.4 KiB |
| ZONA 1 | |
|---|---|
| ZONA 7 | |
| ZONA 12 | |
| ZONA 18 | |
| ZONA 6 | |
| Other values (16) |
Length
| Max length | 7 |
|---|---|
| Median length | 6 |
| Mean length | 6.3255208 |
| Min length | 6 |
Characters and Unicode
| Total characters | 9716 |
|---|---|
| Distinct characters | 15 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | ZONA 1 |
|---|---|
| 2nd row | ZONA 1 |
| 3rd row | ZONA 1 |
| 4th row | ZONA 1 |
| 5th row | ZONA 1 |
Common Values
| Value | Count | Frequency (%) |
| ZONA 1 | 628 | 9.6% |
| ZONA 7 | 173 | 2.6% |
| ZONA 12 | 114 | 1.7% |
| ZONA 18 | 102 | 1.6% |
| ZONA 6 | 71 | 1.1% |
| ZONA 11 | 62 | 0.9% |
| ZONA 2 | 54 | 0.8% |
| ZONA 19 | 53 | 0.8% |
| ZONA 13 | 46 | 0.7% |
| ZONA 3 | 40 | 0.6% |
| Other values (11) | 193 | 2.9% |
| (Missing) | 5030 |
Length
| Value | Count | Frequency (%) |
| zona | 1536 | |
| 1 | 628 | |
| 7 | 173 | 5.6% |
| 12 | 114 | 3.7% |
| 18 | 102 | 3.3% |
| 6 | 71 | 2.3% |
| 11 | 62 | 2.0% |
| 2 | 54 | 1.8% |
| 19 | 53 | 1.7% |
| 13 | 46 | 1.5% |
| Other values (12) | 233 | 7.6% |
Most occurring characters
| Value | Count | Frequency (%) |
| Z | 1536 | |
| O | 1536 | |
| N | 1536 | |
| A | 1536 | |
| 1536 | ||
| 1 | 1188 | |
| 2 | 202 | 2.1% |
| 7 | 193 | 2.0% |
| 8 | 107 | 1.1% |
| 6 | 89 | 0.9% |
| Other values (5) | 257 | 2.6% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 6144 | |
| Decimal Number | 2036 | 21.0% |
| Space Separator | 1536 | 15.8% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 1188 | |
| 2 | 202 | 9.9% |
| 7 | 193 | 9.5% |
| 8 | 107 | 5.3% |
| 6 | 89 | 4.4% |
| 3 | 86 | 4.2% |
| 9 | 81 | 4.0% |
| 5 | 44 | 2.2% |
| 0 | 27 | 1.3% |
| 4 | 19 | 0.9% |
Uppercase Letter
| Value | Count | Frequency (%) |
| Z | 1536 | |
| O | 1536 | |
| N | 1536 | |
| A | 1536 |
Space Separator
| Value | Count | Frequency (%) |
| 1536 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 6144 | |
| Common | 3572 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1536 | ||
| 1 | 1188 | |
| 2 | 202 | 5.7% |
| 7 | 193 | 5.4% |
| 8 | 107 | 3.0% |
| 6 | 89 | 2.5% |
| 3 | 86 | 2.4% |
| 9 | 81 | 2.3% |
| 5 | 44 | 1.2% |
| 0 | 27 | 0.8% |
Latin
| Value | Count | Frequency (%) |
| Z | 1536 | |
| O | 1536 | |
| N | 1536 | |
| A | 1536 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 9716 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| Z | 1536 | |
| O | 1536 | |
| N | 1536 | |
| A | 1536 | |
| 1536 | ||
| 1 | 1188 | |
| 2 | 202 | 2.1% |
| 7 | 193 | 2.0% |
| 8 | 107 | 1.1% |
| 6 | 89 | 0.9% |
| Other values (5) | 257 | 2.6% |
| Unnamed: 0 | DEPARTAMENTO | SECTOR | AREA | STATUS | MODALIDAD | JORNADA | PLAN | DEPARTAMENTAL | ZONA | |
|---|---|---|---|---|---|---|---|---|---|---|
| Unnamed: 0 | 1.000 | 0.850 | 0.085 | 0.158 | 0.104 | 0.109 | 0.124 | 0.119 | 0.882 | 0.977 |
| DEPARTAMENTO | 0.850 | 1.000 | 0.146 | 0.186 | 0.125 | 0.296 | 0.125 | 0.107 | 1.000 | 1.000 |
| SECTOR | 0.085 | 0.146 | 1.000 | 0.129 | 0.071 | 0.113 | 0.131 | 0.126 | 0.150 | 0.225 |
| AREA | 0.158 | 0.186 | 0.129 | 1.000 | 0.035 | 0.090 | 0.077 | 0.067 | 0.205 | 0.265 |
| STATUS | 0.104 | 0.125 | 0.071 | 0.035 | 1.000 | 0.021 | 0.164 | 0.139 | 0.134 | 0.126 |
| MODALIDAD | 0.109 | 0.296 | 0.113 | 0.090 | 0.021 | 1.000 | 0.092 | 0.084 | 0.295 | 0.000 |
| JORNADA | 0.124 | 0.125 | 0.131 | 0.077 | 0.164 | 0.092 | 1.000 | 0.560 | 0.134 | 0.093 |
| PLAN | 0.119 | 0.107 | 0.126 | 0.067 | 0.139 | 0.084 | 0.560 | 1.000 | 0.116 | 0.065 |
| DEPARTAMENTAL | 0.882 | 1.000 | 0.150 | 0.205 | 0.134 | 0.295 | 0.134 | 0.116 | 1.000 | 0.994 |
| ZONA | 0.977 | 1.000 | 0.225 | 0.265 | 0.126 | 0.000 | 0.093 | 0.065 | 0.994 | 1.000 |
| Unnamed: 0 | CODIGO | DISTRITO | DEPARTAMENTO | MUNICIPIO | ESTABLECIMIENTO | DIRECCION | TELEFONO | SUPERVISOR | DIRECTOR | NIVEL | SECTOR | AREA | STATUS | MODALIDAD | JORNADA | PLAN | DEPARTAMENTAL | CODIGO DISTRITO DEPARTAMENTO MUNICIPIO ESTABLECIMIENTO DIRECCION TELEFONO SUPERVISOR DIRECTOR NIVEL SECTOR AREA STATUS MODALIDAD JORNADA PLAN DEPARTA | CODIGO DISTRITO DEPARTAMENTO MUNICIPIO ESTABLECIMIENTO DIRECCION TELEFONO SUPERVISOR DIRECTOR NIVEL SECTOR AREA STATUS MODALIDAD JORNADA PLAN | CODIGO DISTRITO DEPARTAMENTO MUNICIPIO ESTABLECIMIENTO DIRECCION TELEFONO SUPERVISOR DIRECTOR NIVEL SECTOR AREA STATUS MODALIDAD JORNADA PLAN | CODIGO DISTRITO DEPARTAMENTO MUNICIPIO ESTABLECIMIENTO DIRECCION TELEFONO SUPERVISOR DIRECTOR NIVEL SECTOR AREA STATUS MODALIDAD JORNADA PLAN DE | ZONA | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 0 | 16-01-0138-46 | 16-031 | ALTA VERAPAZ | COBAN | COLEGIO COBAN | KM.2 SALIDA A SAN JUAN CHAMELCO ZONA 8 | 77945104 | MERCEDES JOSEFINA TORRES GALVEZ | JULIO CESAR VILLELA AMADO | DIVERSIFICADO | PRIVADO | URBANA | ABIERTA | MONOLINGUE | MATUTINA | DIARIO(REGULAR) | ALTA VERAPAZ | NaN | NaN | NaN | NaN | NaN |
| 1 | 1 | 16-01-0139-46 | 16-031 | ALTA VERAPAZ | COBAN | COLEGIO PARTICULAR MIXTO VERAPAZ | KM 209.5 ENTRADA A LA CIUDAD | 77367402 | MERCEDES JOSEFINA TORRES GALVEZ | NaN | DIVERSIFICADO | PRIVADO | URBANA | ABIERTA | MONOLINGUE | MATUTINA | DIARIO(REGULAR) | ALTA VERAPAZ | NaN | NaN | NaN | NaN | NaN |
| 2 | 2 | 16-01-0140-46 | 16-031 | ALTA VERAPAZ | COBAN | COLEGIO "LA INMACULADA" | 7A. AVENIDA 11-109 ZONA 6 | 78232301 | MERCEDES JOSEFINA TORRES GALVEZ | VIRGINA SOLANO SERRANO | DIVERSIFICADO | PRIVADO | URBANA | ABIERTA | MONOLINGUE | MATUTINA | DIARIO(REGULAR) | ALTA VERAPAZ | NaN | NaN | NaN | NaN | NaN |
| 3 | 3 | 16-01-0141-46 | 16-005 | ALTA VERAPAZ | COBAN | ESCUELA NACIONAL DE CIENCIAS COMERCIALES | 2A CALLE 11-10 ZONA 2 | 79514215 | RUDY ADOLFO TOT OCH | NaN | DIVERSIFICADO | OFICIAL | URBANA | ABIERTA | MONOLINGUE | MATUTINA | DIARIO(REGULAR) | ALTA VERAPAZ | NaN | NaN | NaN | NaN | NaN |
| 4 | 4 | 16-01-0142-46 | 16-005 | ALTA VERAPAZ | COBAN | INSTITUTO NORMAL MIXTO DEL NORTE 'EMILIO ROSALES PONCE' | 3A AVE 6-23 ZONA 11 | 79521468 | RUDY ADOLFO TOT OCH | NaN | DIVERSIFICADO | OFICIAL | URBANA | ABIERTA | BILINGUE | VESPERTINA | DIARIO(REGULAR) | ALTA VERAPAZ | NaN | NaN | NaN | NaN | NaN |
| 5 | 5 | 16-01-0143-46 | 16-031 | ALTA VERAPAZ | COBAN | COLEGIO PARTICULAR MIXTO IMPERIAL | 5A. CALLE 1-9 ZONA 3 | 57101061 | MERCEDES JOSEFINA TORRES GALVEZ | HECOTR WALDEMAR TOT COY | DIVERSIFICADO | PRIVADO | URBANA | ABIERTA | MONOLINGUE | DOBLE | FIN DE SEMANA | ALTA VERAPAZ | NaN | NaN | NaN | NaN | NaN |
| 6 | 6 | 16-01-0145-46 | 16-006 | ALTA VERAPAZ | COBAN | INSTITUTO DE TURSMO Y AVIACON DEL NORTE I.T.A.N | 3 AV. 5-28 ZONA 4 | 54641454 | EFRAIN CAAL CUC | LUIS FERNANDO SOTO | DIVERSIFICADO | PRIVADO | URBANA | CERRADA TEMPORALMENTE | MONOLINGUE | MATUTINA | DIARIO(REGULAR) | ALTA VERAPAZ | NaN | NaN | NaN | NaN | NaN |
| 7 | 7 | 16-01-0147-46 | 16-031 | ALTA VERAPAZ | COBAN | COLEGIO "LA INMACULADA" | 7A. CALLE 11-09 ZONA 6 COBAN | 49532425 | MERCEDES JOSEFINA TORRES GALVEZ | MERCEDES QUIROS QUIROS | DIVERSIFICADO | PRIVADO | RURAL | CERRADA TEMPORALMENTE | MONOLINGUE | DOBLE | DIARIO(REGULAR) | ALTA VERAPAZ | NaN | NaN | NaN | NaN | NaN |
| 8 | 8 | 16-01-0150-46 | 16-006 | ALTA VERAPAZ | COBAN | INSTITUTO INTERCULTRUAL ALTAVERAPACENCESE -IIAV- | 3A. AVAENIDA 1-23 ZONA 4 | NaN | EFRAIN CAAL CUC | GUILLERMO ESTUARDO VASQUEZ MORALES | DIVERSIFICADO | PRIVADO | URBANA | CERRADA TEMPORALMENTE | BILINGUE | DOBLE | FIN DE SEMANA | ALTA VERAPAZ | NaN | NaN | NaN | NaN | NaN |
| 9 | 9 | 16-01-0155-46 | 16-031 | ALTA VERAPAZ | COBAN | LICEO "MODERNO LATINO" | 11 AVENIDA 5-17 ZONA 4 | 79522555 | MERCEDES JOSEFINA TORRES GALVEZ | JORGE BENEDICTO COC POP | DIVERSIFICADO | PRIVADO | URBANA | ABIERTA | MONOLINGUE | DOBLE | FIN DE SEMANA | ALTA VERAPAZ | NaN | NaN | NaN | NaN | NaN |
| Unnamed: 0 | CODIGO | DISTRITO | DEPARTAMENTO | MUNICIPIO | ESTABLECIMIENTO | DIRECCION | TELEFONO | SUPERVISOR | DIRECTOR | NIVEL | SECTOR | AREA | STATUS | MODALIDAD | JORNADA | PLAN | DEPARTAMENTAL | CODIGO DISTRITO DEPARTAMENTO MUNICIPIO ESTABLECIMIENTO DIRECCION TELEFONO SUPERVISOR DIRECTOR NIVEL SECTOR AREA STATUS MODALIDAD JORNADA PLAN DEPARTA | CODIGO DISTRITO DEPARTAMENTO MUNICIPIO ESTABLECIMIENTO DIRECCION TELEFONO SUPERVISOR DIRECTOR NIVEL SECTOR AREA STATUS MODALIDAD JORNADA PLAN | CODIGO DISTRITO DEPARTAMENTO MUNICIPIO ESTABLECIMIENTO DIRECCION TELEFONO SUPERVISOR DIRECTOR NIVEL SECTOR AREA STATUS MODALIDAD JORNADA PLAN | CODIGO DISTRITO DEPARTAMENTO MUNICIPIO ESTABLECIMIENTO DIRECCION TELEFONO SUPERVISOR DIRECTOR NIVEL SECTOR AREA STATUS MODALIDAD JORNADA PLAN DE | ZONA | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 6556 | 15974 | 19-08-0011-46 | 19-013 | ZACAPA | SAN DIEGO | INSTITUTO NACIONAL DE EDUCACION DIVERSIFICADA | BARRIO EL CENTRO | NaN | DOUGLAS DONALDO URRUTIA MATEO | VIVIAN ILIANA MIRANDA DIAZ | DIVERSIFICADO | OFICIAL | URBANA | ABIERTA | MONOLINGUE | NOCTURNA | DIARIO(REGULAR) | ZACAPA | NaN | NaN | NaN | NaN | NaN |
| 6557 | 15975 | 19-08-0890-46 | 19-013 | ZACAPA | SAN DIEGO | INSTITUTO DIVERSIFICADO POR COOPERATIVA PROF. CARLOS ROBERTO DONIS OSORIO | BARRIO EL CENTRO | NaN | DOUGLAS DONALDO URRUTIA MATEO | LEONEL AUGUSTO LEMUS MOSCOSO | DIVERSIFICADO | COOPERATIVA | URBANA | ABIERTA | MONOLINGUE | VESPERTINA | DIARIO(REGULAR) | ZACAPA | NaN | NaN | NaN | NaN | NaN |
| 6558 | 15976 | 19-09-0008-46 | 19-014 | ZACAPA | LA UNION | INSTITUTO NACIONAL DE EDUCACION DIVERSIFICADA | BARRIO NUEVO | NaN | WILBER OBDULIO MEJIA SUCHITE | GUSTAVO LEIVA MORALES | DIVERSIFICADO | OFICIAL | URBANA | ABIERTA | MONOLINGUE | VESPERTINA | DIARIO(REGULAR) | ZACAPA | NaN | NaN | NaN | NaN | NaN |
| 6559 | 15977 | 19-09-0034-46 | 19-021 | ZACAPA | LA UNION | LICEO PARTICULAR MIXTO "JIREH" | BARRIO NUEVO | NaN | BERTA ALICIA LEIVA CORDON DE GARCIA | ANA MARIA CUELLAR GUERRA DE RAMIREZ | DIVERSIFICADO | PRIVADO | URBANA | ABIERTA | MONOLINGUE | DOBLE | FIN DE SEMANA | ZACAPA | NaN | NaN | NaN | NaN | NaN |
| 6560 | 15978 | 19-09-0037-46 | 19-021 | ZACAPA | LA UNION | LICEO PARTICULAR MIXTO "JIREH" | BARRIO NUEVO | NaN | BERTA ALICIA LEIVA CORDON DE GARCIA | ANA MARIA CUELLAR GUERRA DE RAMIREZ | DIVERSIFICADO | PRIVADO | URBANA | ABIERTA | MONOLINGUE | DOBLE | DIARIO(REGULAR) | ZACAPA | NaN | NaN | NaN | NaN | NaN |
| 6561 | 15979 | 19-09-0040-46 | 19-021 | ZACAPA | LA UNION | LICEO PARTICULAR MIXTO "JIREH" | BARRIO NUEVO | NaN | BERTA ALICIA LEIVA CORDON DE GARCIA | ANA MARIA CUELLAR GUERRA DE RAMIREZ | DIVERSIFICADO | PRIVADO | URBANA | ABIERTA | MONOLINGUE | MATUTINA | DIARIO(REGULAR) | ZACAPA | NaN | NaN | NaN | NaN | NaN |
| 6562 | 15980 | 19-09-0048-46 | 19-021 | ZACAPA | LA UNION | LICEO PARTICULAR MIXTO " JIREH" | BARRIO NUEVO | NaN | BERTA ALICIA LEIVA CORDON DE GARCIA | ANA MARIA CUELLAR GUERRA DE RAMIREZ | DIVERSIFICADO | PRIVADO | URBANA | ABIERTA | MONOLINGUE | SIN JORNADA | SEMIPRESENCIAL (UN DIA A LA SEMANA) | ZACAPA | NaN | NaN | NaN | NaN | NaN |
| 6563 | 15981 | 19-10-0013-46 | 19-015 | ZACAPA | HUITE | INSTITUTO DIVERSIFICADO | BARRIO BUENOS AIRES | NaN | YADIRA FERNANDA SOSA GUERRA | MARLON JOSUE ARCHILA LORENZO | DIVERSIFICADO | OFICIAL | URBANA | ABIERTA | MONOLINGUE | NOCTURNA | DIARIO(REGULAR) | ZACAPA | NaN | NaN | NaN | NaN | NaN |
| 6564 | 15982 | 19-10-1009-46 | 19-015 | ZACAPA | HUITE | INSTITUTO MIXTO DE EDUCACION DIVERSIFICADA POR COOPERATIVA DE ENSENANZA | BARRIO EL CAMPO | NaN | YADIRA FERNANDA SOSA GUERRA | ROBIDIO PORTILLO SALGUERO | DIVERSIFICADO | COOPERATIVA | URBANA | ABIERTA | MONOLINGUE | VESPERTINA | DIARIO(REGULAR) | ZACAPA | NaN | NaN | NaN | NaN | NaN |
| 6565 | 15983 | 19-11-0018-46 | 19-020 | ZACAPA | SAN JORGE | INSTITUTO MIXTO DE EDUCACION DIVERSIFICADA POR COOPERATIVA | BARRIO EL CENTRO | NaN | ALBA LUZ MENDEZ | VICTOR HUGO GUERRA MONROY | DIVERSIFICADO | COOPERATIVA | URBANA | ABIERTA | MONOLINGUE | MATUTINA | DIARIO(REGULAR) | ZACAPA | NaN | NaN | NaN | NaN | NaN |